Reproducible Research 2.0

Dublin Data Science

Mick Cooney

2019-08-21

What is Reproducible Research?

An Example?

Anecdotes???

Meetups


Dublin Data Science


Insurely You’re Joking (Dublin|London)


Anyone who will have me

Talks


Workshops


Informal help

What To Do?

Reproducibility

Reproducible Research

Aspects


  1. Source Control
  2. Workbooks
  1. Source Control
  2. Workbooks
  3. Makefiles
  4. Containers and Docker

Source Control

git


Track changes


Collaboration

Issue tracking


Branch management

Workbooks

What is research?

Outcome unknown…

Try lots of stuff…

Record of work

Jupyter vs Zeppelin vs Rmarkdown

NOT for production

Makefiles

Dependency Management

make and Makefiles

Directed Acyclic Graph (DAG)

sysadmin tasks

Containers and Docker

Rewind

Quitting from lines 272-288 (10_carinspricing_exploration.Rmd) 
Error in `[.tbl_df`(policyprop_dt, claim_count > 0) : 
  object 'claim_count' not found
Calls: <Anonymous> ... ggplot -> [ -> [.grouped_df -> NextMethod -> [.tbl_df

Execution halted

Summary

Questions?


Email:


GitHub:

https://github.com/kaybenleroll/data_workshops